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Abstract. Let A be a subset of F™ , the n-dimensional linear space over the prime field W p of size 
at least SN (N — p n ), and let S v = P _1 (w) be the level set of a homogeneous polynomial map 
P : F™ Fp of de gree d, and v £ Fp. We show, that under appropriate conditions, the set A 
contains at least c N\S\ arithmetic progressions of length I < d with common difference in S v , where 
c is a positive constant depending on 8, I and P. We also show that the conditions are generic for 
a class of sparse algebraic sets of density « N~ £ . 



1. Introduction. 

1.1. Background. A famous result of Szemeredi j9] states that a set A of positive upper density 
of the integers contains arbitrarily long arithmetic progressions x, x + d, . . . , x + Id. There has 
been many generalizations and extensions, a natural question one may ask is whether one may add 
restrictions on the common difference d. A typical example of this type is the well-known theorem 
of Sarkozy [8], saying that A contains two elements whose difference is a square. More recently 
Green has shown [3J that A contains a 3-term arithmetic progression, whose common difference is 
a sum of two squares. Far reaching results of this type for longer progressions have been obtained 
recently by Green and Tao [5] and by Wooley and Ziegler |12| where the gap is of the form p — 1 
and P(p — 1), p being a prime and P an integral polynomial such that P(0) = 0. 

The aim of this note is to provide a simple extension of this type in the finite field settings, where 
A C F™ is a set of density 5 > and one is counting arithmetic progressions in A with gaps in 
algebraic sets S given as level sets of a family of homogeneous polynomials. 

1.2. Main results. Let F" be the n-dimensional linear space above the prime field ¥ p , and for a 
fixed 5 > let A C F™ be a set of at least \A\ > Sp n elements. The finite field version of Szemeredi's 
theorem states that such sets will contain genuine arithmetic progressions of length I < p as long 
as n is large enough. Note that the condition I < p is natural as it ensures that progressions in 
the form {x, x + y, . . . , x + (Z — l)y} consist of distinct points for y ^ 0. We will need the following 
quantitative version 

Theorem A. Let 5 > and I G N. For a function f : F™ — ^ [0, 1] satisfying E(f(x) : x € F") > 5 
one has that 

E(f(x)f(x + y)...f(x+(l-l)y): x,y G F") > c(6,l,p), (1.1) 

where c(8,l,p) is a positive constant depending only on 5, I and p. 

Let S v = P _1 (u) be an algebraic set defined as the level set of a family of homogeneous polyno- 
mials P = (Pi, . . . , Pr) : F^ — > of de gree d, v € F^ being a given vector. We will be interested 
in counting Z-term arithmetic progressions in A with common difference y in S v . It is clear that in 
order to make this problem well-defined one needs to make a few assumptions. First S v needs to 
be nonempty, and in order to have a more precise formula for the size of S v , a natural assumption 
is that the associated set of singular points 

S* P := {x G F™ : rank (Jac P (x)) < R, } (1.2) 
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is small. Here Jacp(x) is the R x n matrix with entries d Xj Pi(x). This will be done by requiring 
that 

K := codim(Sp) (1.3) 

is sufficiently large. Note that the dimension of an algebraic set is defined above the algebraic closure 
of the prime field F p . To avoid degeneracies like the identical vanishing of certain derivatives we'll 
also assume that d < p. Let us introduce the parameters < a,/3,7 < 1 by 

7 n = R, (3n = K, p a = d (1.4) 

Our main result is the following. 

Theorem 1. Let 5 > 0, e > be given, and let A C F™ be a set of size \A\ > 5p n . Let the 
polynomial map P and the parameters < a, f3, 7 < 1 be defined as above. Then for I G N, I < d 
one has uniformly in v G Fp that 

E(l A (x)l A (x + y)...l A (x + (l-l)y) : x e¥%, y e S v ) > c(e,6,l,p), (1.5) 

provided that 

/3-a-(2 d + l) 7 >e, (1.6) 

with a constant c(e,5,l,p) > depending only on e,5,l and p. Here 1 A stands for the indicator 
function of the set A, and S v = P _1 (w). 

Remarks: 

• If n is large enough, it follows that A contains ~ |Fp||/S w | non-trivial progressions of length 
I with common difference y G S v . In particular for every v G F^ there is a progression with 
common difference y such that P{y) = v. As a byproduct of the proof we also obtain that 
\S V \ ~ p n ~ R , uniformly in v, thus is a "sparse" set of density of ~ p~ 7 ™. 

• The condition I < d seems necessary, as it can be seen from the following example in [2] 
adapted to the finite field settings. Let d = 2, R = 1 and P(x) = x\ + . . . + x\. Fix 
p > 2 (say p = 5) and let A = {x G F p : P(x) = 0}, then A has density ~ p^ 1 . By 
the parallelogram identity: P(x) — 2P(x + y) + P(x + 2y) = 2P(y), if ^4 contains a 3-term 
arithmetic progression {x, x+y, x+2y} then necessarily P(y) = 0. One can construct similar 
examples for the polynomials Q(x) = ^ • Xj of degree d for all d > 2. These examples also 
show the necessity of a condition on the singular set. Indeed (1.5) does not hold for the 
level sets of P(x) = (x\ + . . . , +x^) d ^ 2 (d > 2 even) for I > 2, while it dose hold for the 
level sets of Q(x) = xf + ... for I = d. The difference is that S P is n — 1-dimensional 
while S* Q = {0}. 

• For the special case, when v = the set A contains non-trivial arithmetic progressions 
with gap y G S = P _1 (0) of length / > d, under the more restrictive conditions that 
R < c(5, l,p, d) n. This is based on the fact that zero set of homogeneous polynomial maps 
contain a large linear subspace, which follows from a theorem of Chevalley and Warning 
[1], see Thm. 6.11. This will be discussed in Section 4. 

We will also study polynomial maps P for which the conditions of Theorem 1 hold. Consider 
first diagonal forms, when Pi(x) = Y^j=i a ij x ji A = {aij}i<i<_R,i<j<n being an R x n matrix. We 
say that the matrix A is non- degenerate if rank A' = R for every R x n/2 submatrix of A. 

Claim 1. If A is non- degenerate then dimSp < n/2. 
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Proof. For 1 < j < n let &j be the j-th column of the matrix A. Then the jth column of the 



Jacobian Jacp(x) is dxj 1 &j at x = (x%, . . . ,x n ). If x has at least n/2 nonzero coordinates, say 
Xj 1 , . . . , Xj m then the corresponding columns of Jacp(x) span hence rank (Jacp(x)) = R. Thus 
S* p is contained in the union of the n/2-dimensional coordinate hyperplanes. □ 

It is easy to see that most Rxn matrices are non-degenerate. Let A be a random matrix obtained 
by choosing each of its column vector aj independently with probability p~ R . 

Claim 2. Let 70 := | — j^rf- If p > 5, R = with 7 < 70, then the probability that a random 
matrix A is non-degenerate is at least 1 — p~^to-i)n_ 

Proof. For a given subspace M < F™ of codimension 1, the probability that the rows a^, aj 2 , . . . , aj m 
of the random matrix A are all contained in M is p~ m . Thus the probability that M contains at 
least n/2 columns of A is less than 2 n p~ n / 2 . Since there are p R distinct R— 1 dimensional subspaces, 
the probability that none of them will contain at least n/2 columns of A is at least 

1 _ 2 ? V ? -f > 1 _p-(70-7)n. 

In that case A is non-degenerate. □ 

This implies that condition (1.6) holds for such maps as long as a + {2 d + 1)^ < 1/2 — e. In fact we 
will show that it also holds for generic (non-diagonal) polynomial maps as long as a and 7 is chosen 
sufficiently small. More precisely, let V(n, d) be the space of homogeneous polynomials P : F™ — > F p 

of degree d, which is an N = ( n+ ^ X ) dimensional linear space over ¥ p , the algebraically closed 
field of characteristic p. Then F(n, d) R is the space of i?-tuples of such polynomials. One has 

Proposition 1. Let l<R<L<nbe given. Then the locus of polynomial maps P G F(n, d) R 
such that dimSp > L is contained in an algebraic set of codimension at least L — 2R + 2. 

This mean that for generic polynomial maps P the dimension of the singular variety satisfies the 
bound: dimSp < 2R — 2. If R = 771 then for generic maps one can take /3 = 1 — 27, thus (1.6) 
holds if: a + {2 d + 3)7 < 1 - e. 

1.3. Outline of the Proof. To begin we shall need a generalized von Neumann inequality for 
restricted progressions. To state it, let us briefly recall the Gowers uniformity norms. The multi- 
plicative derivative of a function / is given by Ahf(x) = f(x + h)f(x), and higher derivatives as 
hi = ^/ii(^/n,..,/ii_i)' The [7 z -Gowers norm is then given by 

\\f\\%=-E(A hl ,..., hl f(x):x,h 1 ,...,h l e¥;). 

This norm represents the average of / over 'cubes' in F^, sets of the form 

{x + Lu%hi + ... + uj t hi : ux, ...,oji £ {0, 1}'}, 

and is indeed a norm for integers I > 1 (I = 1 provides a semi-norm). On fact we shall need is the 
monotonicity formula 

ii/iic/'-! < \\i\y- (1.7) 

For the above definitions and facts on may consult [TO] • For a pair of functions /, g : F™ — > C define 
the form 

Kl{f,g) = E(/(x)/(x + r)...f(x + (I - l)r)g(r) : x, r G F"). 

Lemma 1. (Generalized von Neumann inequality) 

For functions f, g bounded in absolute value by one, one has 

M(f,g) < Ibllc/J- 
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To apply this result in combination with Theorem A, one finds an appropriate balanced function 
of the level set S = say g = Is — p for an appropriate constant p, and writes 

K l (f,l s ) = pAif + A l (f,g). (1.8) 

The first term applies to Theorem A, while the second can be bounded by Hsllf/i by Lemma [TJ 
Then it remains to show that p can be chosen properly as to give ||<7||j/i small. Thus the crucial 
step is to obtain the following bound, which may be of interest on its own. 

Proposition 2. Let v G F^ and let S = P _1 (u), where P = (Pi,...,P R ) : F£ ->• F^ is a 

homogeneous polynomial map of degree d. Then one has 

\\i s - P - R \\ ud < (d - ir dn P 2 - d ( R - K \ (i.9) 

where K = codim(Sp) , S* p being the singular variety associated to P. 

Proof of Theorem[l\ Let the parameters < a,/3,7 < 1 be defined as in (1.4) and assume that 
condition (1.6) holds. First, note that by (1.9) and (1.6) we have that 

\p- n \S\ - P - R \ = ||l 5 -jr ,l || c ,i < p«(«+7-/3)2~ d < p -en p -R 

thus in particular \S\ = p n ~ R (l + 0{p' £n )). 

Let / = 1a, p = p~ R , g = Is ~ P~ R , then by (1.8) 

E (1a(x)1 a (x + y)... l A (x + (I - l)y) : x£F;,y£S) = 

= P^ 1 (p-%/ + Hf, g)) = (1 + 0{p- £n )) (A,/ + P R Hf, g)) 
By Theorem A the first term satisfies 

A/>c(5,/,p) 

while by (1.9) and (1.6) the second term is at most 

p R Hf,g)) <p R p^ a+ ^ )2 ' d <p- en 

This implies that the left side of (1.5) is at least c(5,l,p)/2 for n > n(e, S, l,p), while it is trivially 
at least p~ n for all n's. This proves Theorem 1. □ 

It remains to prove Proposition [21 The starting point is the identity 

l s (x) =E(e(a- (P(x) -v) : a e¥ R ) =p- R + p- R ^ e(-a ■ v) e(a ■ P(x)). 

The triangle inequality for the Gowers norms then gives 

lbll^<^ R Yl (M a - p (*))\\v*, 

reducing our task to bounding ||e(a • P(x))\\ud for a nonzero a. 

We apply the method of Birch pQ to achieve a bound in terms of an explicitly given algebraic 
set W* C Fp d 1 ^ n , which will be discussed in detail in the next section. 

Lemma 2. If a ^ in F™, then we have 



\W* 
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The set W* appears in the work of Birch on exponential sums pQ and is closely related to the 
singular set Sp, see (2.6) below. In particular if one embeds F™ into the diagonal A C Fp d 1 ' n 
via the map &(h) = (h,...,h), then &(S P ) = W* f}A. As A is a linear subspace of codimension 
(d - 2)n, it follows that dim(W*) < [d - 2)n + dim(S* P ). Thus, codim(W*) > codim(S* P ) = K. 
Also W* can be partitioned into algebraic sets W^, (A € F^\{0}) such that W£ is defined by n 
equations of degree d — 1. Then basic facts from algebraic geometry give 

Lemma 3. With W* as above, we have \W*\ < (d — l) n p R p^- 1 )™-^ . 

Thus Proposition [2] follows save for the proofs of Lemma 2 and Lemma 3. 

2. Exponential sum estimates. 

Let P = (Pi, . . . , P R ) : F™ -> F^ be a polynomial map, Pj being a homogeneous polynomial of 
degree d, written in the symmetric form 

Pi{%) = ^ ^ a ji---jd ' ' ' X 3di X = • • • i-^n) (2-1) 
l<iii— dd<n 

where a 1 ^ ^ = aj,.^ n (j d \ for any permutation ir : {1, ...,d} — > {l,...,d}. Note that this is 
possible as (d\,p) = 1. For h € F™ define the differencing operator 

Z^POr) = P i (x + / l ) -P(x), (2.2) 

and note that cfeg {DhP) = deg (P) — 1. After applying the differencing operators d — 1 times one 
obtains a linear function of the form 

n 

D h i... D hd -iPi(x) »;(/r /i"" 1 )^, (2.3) 

3=1 

where ^(/i 1 , . . . , h d ~ l ) is the multilinear form 

I*' ') ' /! £ "},..,,„ >>). ■■■!>", '• (2-4) 

l<ii>— Jd<" 

for any (d — l)-tuple of vectors (h 1 , . . . , h^ 1 ) € Fp rf 1 . Note that on the diagonal 

$)(h,...,h) = (d-iy. d x .Pi(h) (2.5) 

Define the set W* associated to the polynomial map P by 

W* = {{h 1 , h d ~ l ) £ F^ 1 ) 71 : rank ($(/i\ h d ~ 1 )) < R}, (2.6) 
where ^(h 1 , . . . , h^ 1 ) is the R x n matrix with entries ^(h 1 , . . . , h^ 1 ) for 1 < i < R, 1 < j < n. 

Proof of Lemma 2. Using the definition of U d norm: 

\\e(a-P)\\ ud =E(A hl _ hd e(a-P(x): x, hi, h d e F£) (2-7) 
= E(e(a • D hl _ hd P(x)) : x,h u /i d G F"). 

From (2.2) (2.3) and the definition of the matrix $(/i 1 , . . . , /i^" 1 ) it is clear that 

& hl ,...,hA a ■ p ( x )) = <^ T {h\ h d ~ l )a ■ h d ) (2.8) 

where <3? T is the transpose of the matrix <1> and "•" is the dot product. If rank (^(/t 1 , . . . , h^ 1 )) = R 
then <& T (h 1 , . . . , h d ~ l ) a ^ hence summing (2.8) in the h d variable vanishes. Thus only the tuples 
(h 1 , . . . , h^ 1 ) G W* contribute to \\e(a ■ P)\\u<i and Lemma 2 follows. □ 
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If rank (^(/i 1 , . . . , h d 1 )) < R then its rows . . . , <3? R are linearly dependent, thus one may 
write 

w*= |J W*(X 1 ,...,X R ), 

(Ai,...,Afl)^0 

where for A = (Ai, . . . , Xr) ^ 

W*(X) = {(h\ . . . , h d - 1 ) G ¥f-^ n : Ai^^/i 1 , . . . , h d - 1 ) + ... + X R <S> R (h\ h d ~ l ) = 0}. 

(2.9) 

Note that W*(X) is a homogeneous algebraic set defined by n equations of degree d — 1. To 
estimate the size these sets we need the following basic facts from algebraic geometry. 

Lemma 4. [6] For a homogeneous (affine) algebraic set U C F™ of degree r and dimension s one 
has that 

\U\ < rp s (2.10) 

The degree of the set U is defined as the degree of its image U in the m— 1-dimensional projective 
space above ¥ p . For projective algebraic sets it is shown in [6j, Prop. 12.1, that \U°\ < r7r s (F p ) 
where 7r s (F p ) = (p s — \)/{p — \) is the size of the s — 1-dimensional projective space. This implies 

(2.10) as \U\ = (p- 1)\U°\ + 1. 

Lemma 5. If a homogeneous algebraic set U is defined by n equations of degrees d\, . . . ,d n then 
its degree is bounded by 

deg (U) < dtd 2 . . . d n (2.11) 

The degree of a (projective) algebraic set is defined as the sum of degrees of its irreducible 
components, and for a projective algebraic variety it can be defined geometrically as the number 
of intersection points with a generic subspace of complementary dimension or algebraically, see [7] 
Prop. 7.6 and the definition preceding it. Note that the degree of a hypersurface is the degree of 
its defining polynomial. Lemma 5 may be viewed as a generalization of Bezout's theorem on the 
number of intersection of plane algebraic curves, written as an inequality ignoring the multiplicities 
of the intersection points. It follows easily from the following basic inequality which is an immediate 
corollary of Thm. 7.7 in [7]. 

Theorem B. Let Y be a (projective) variety of dimension at least 1 and let H be a hypersurface 
not containing Y . Let Z\, . . . , Z s be the irreducible components of the intersection Y n H . Then 

s 

deg (Zi) < deg (Y) deg (H) (2.12) 

i=i 

Proof of Lemma 5. One may write (2.12) as 

deg (Y n H) < deg (Y) deg (H) (2.13) 

as long as Y H and dimY > 1. However for Y C H or when dimY = (Y being a point and 
deg (Y) = 1) inequality (2.13) holds trivially. It also extends to algebraic sets V by writing them 
as union of their irreducible components Y and using (2.13) for each Y together with the fact that 
each irreducible component of V n H is contained in some Y n H for an irreducible Y C V . Then 

(2.11) follows immediately by induction on n. □ 

Proof of Lemma 3. For a given A G F^ \{0} one has that dimW*(X) < dimW* < (d-l)n-K, and 
since it is defined by n equations of degree d— 1 its degree satisfies the bound deg (W*(A) < (d— l) n . 
Hence by (2.10) and (2.11) one has for all A G F^\{0} 

\W*(X)\ < (d-l) n p^ n ~ K 
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and Lemma 3 follows from the decomposition W* = Li\W*(X). □ 

3. Generic polynomial maps. 

We will work above the algebraically closed field of characteristic p, denoted by F p . Though it 
is best to view the arguments below by identifying homogeneous algebraic sets with their image in 
the projective space, we will keep to the affine settings. Let V(n, d) be the space of homogeneous 
polynomials P : F™ —> ¥ p of degree d, which is the N = ( n+ ^ _1 ) -dimensional linear space over ¥ p . 
The critical locus C C P(n, d) x F™ as the set of pairs (P, x) such that x G Sp, the singular variety 
of Sp = P _1 (0), that is d Xl P(x) = ... = d Xn P(x) = 0. Its image under the natural projection 
7r : V(n,d) x F™ -^V{n,d) is the discriminant T> = T>(n,d), which is the locus of polynomials P so 
that Sp is singular. It is known that C is a connected smooth variety of dimension equal to that of 
D, which is a hypersurface in V(n,d), see [IT] . Cor. 2 and Cor. 5. 

For given K > 1 let T>k = ^(n,^) be the locus of polynomials P such that dimSp > K, and 
let Ck = ^~ 1 (Dr) be its pre-image. The set Dk is a Zarisky closed set, moreover the pre-image 
7r _1 (P) = Sp of every point P G T>k has dimension at least K (at least dimension K — 1 in the 
projective settings). This implies that 

dim T>k < dim C — K + \ = N — K. (3.1) 

Thus T>k has codimension at least K. 

Proof of Proposition d For given R let V(n,d) R the space of homogeneous polynomial maps P = 
(Pi,...,Pr) : F ™ -> F p of degree d. If x G S* p then by definition (1.2), these exists a nonzero 
H = (/ii, . . . , [Mr) G F^, such that x G Sp , where P M = /xiPi + . . . + ^rPr- Note that Sp^ = Sp^ 
if ji = Xji 1 for a scalar A 7^ 0. Thus 



S P C (J <? Pfi , (3.2) 

where IL^ -1 is the R — 1 dimensional projective space above F p , considered as an algebraic set in 
Fp. This implies that for a given L > 2P, if dim S P >L then there must exist a \i 7^ such that 
dim Sp M > L — P + 1. 

Let $ : ILp^ 1 x V(n,d) R -> P(n, d) be the map defined by $(ju,P) = P M , and let ?r : IL^ 1 x 
V(n, d) R — > V(n, d) R be the natural projection. Then the locus of polynomial maps P for which 
dim Sp > L is contained in 

{P G P(n, d) R : dim 5p > L} C vr($- 1 D^) (3.3) 

with K = L — R + 1. The tangent map d^u P \ is onto at every point (/U, P) where /i 7^ 0, thus 
the codimension of the algebraic set is at least X. The projection tt cannot increase the 

dimension, hence the codimension of 7t(<&~ 1 Dk Q ^(n, d) R is at least K — R + 1. If L > 2R — 1 
then the set of polynomial maps P for which dim Sp > L has codimension at least L — 2R + 2 > 1, 
thus is contained in a proper algebraic set. This proves Proposition [TJ □ 

4. Linear subspaces in homogeneous varieties. 

We will show that a homogeneous variety S = P -1 (0) contains a large linear subspace M, as an 
easy corollary of the following result due to Chevalley and Warning ([4], Thm. 6.11) 

Theorem C. Let Qi, ■ ■ ■ ,Qt '■ F™ — > ¥ p be polynomials of degree d\,...,dt such that D = d\ + 
. . . + dt < n, and Qi(0) = for all i. If Sq is the common zero set of the polynomials Qi then 

\S Q \> P n - D (4.1) 
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Proposition 3. Let S = P 1 (0) ; where P : F™ — >■ a homogeneous polynomial map of degree 
d < p. Then S contains a linear subspace M such that 

dimM > c d {n/R)^ (4.2) 
with a constant c d > depending only on d. 

Proof. Let M be a maximal subspace, such that M C S. Let h±, . . . , h m be a basis of M. One may 
write Pi{x) = Qi(x, . . . , x) where Qi(x±, . . . , Xd) is a symmetric multi-linear form, as in (2.1). If h is 
such that Qi(h, . . . , h, hi k+1 ,hi d ) = for all 1 < k < d, 1 < i < R, and 1 < ik+i < ■ • • , < id < Tn, 
then M' = M + F p /i C 5 as well. For fixed k this gives i? (™) homogeneous equations of degree k. 
The sum of degrees D of all these equations is bounded by 

D < C d Rm d (4.3) 

By the Chevalley- Warning's Theorem, the number of such h is at least p n ~ D . If m < Cd (m/R) 3, 
then p n ~ D > p m = \M\. Thus one may choose h £ M such that M + F p /i C 5 contradicting our 
assumption. This proves the Proposition. □ 

Corollary 1. Let A C F™ o/ density 8 > 0, and let S = P _1 (0), where P : F™ — > a homogeneous 
polynomial map of degree d < p. If 

R<c(5,l,d,p)n (4.4) 

t/ien yl contains an arithmetic progression {x, x + y, . . . , x + (I — l)y} with common difference 
yeS\{0}. 

Proof Let M be a maximal subspace contained in S. Let M + Xi be a translate of M such that 
the relative density 8i = \ (An (M + Xi)\/\M\ of A on m + x% is at least <5. If the dimension m of M 
is large enough: m > m(8,l,p) then A n (M + Xj) contains a non-trivial arithmetic progression of 
length I, whose gap y is then in M C S 1 . By (4.2) this happens if {n/R) l l d > m(8,l,d,p) for which 
it is enough to assume (4.4). □ 
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